Search CORE

151 research outputs found

Comparing ultra-high spatial resolution remote-sensing methods in mapping peatland vegetation

Author: Arroyo‐Mora J. P.
Bray J. R.
Caliński T.
Hill M. O.
Liaw A.
Lovitt J.
Ridgeway G.
Rouse J. W. J.
Publication venue
Publication date: 01/09/2019
Field of study

Peer reviewe

Crossref

Helsingin yliopiston digitaalinen arkisto

Parallel Mapper

Author: A Collins
D Günther
E Carlsson
G Carlsson
G Carlsson
G Carlsson
G Carlsson
J-D Boissonnat
JR Munkres
LW Beineke
M Nicolau
N Otter
N Shivashankar
PY Lum
R Ghrist
RW Sumner
T Caliński
U Bauer
V Pascucci
V Robins
V Snášel
Y Hiraoka
Publication venue
Publication date: 11/05/2020
Field of study

The construction of Mapper has emerged in the last decade as a powerful and effective topological data analysis tool that approximates and generalizes other topological summaries, such as the Reeb graph, the contour tree, split, and joint trees. In this paper, we study the parallel analysis of the construction of Mapper. We give a provably correct parallel algorithm to execute Mapper on multiple processors and discuss the performance results that compare our approach to a reference sequential Mapper implementation. We report the performance experiments that demonstrate the efficiency of our method

arXiv.org e-Print Archive

Crossref

MaxMin Linear Initialization for Fuzzy C-Means

Author: AM Bensaid
D Steinley
DJ Hand
EH Ruspini
GN Lance
HS Park
J. C. Dunn
JC Bezdek
ME Celebi
MJ Norušis
NR Pal
S Wold
T Caliński
T Su
TF Gonzalez
V Faber
W Wang
XL Xie
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 14/07/2018
Field of study

International audienceClustering is an extensive research area in data science. The aim of clustering is to discover groups and to identify interesting patterns in datasets. Crisp (hard) clustering considers that each data point belongs to one and only one cluster. However, it is inadequate as some data points may belong to several clusters, as is the case in text categorization. Thus, we need more flexible clustering. Fuzzy clustering methods, where each data point can belong to several clusters, are an interesting alternative. Yet, seeding iterative fuzzy algorithms to achieve high quality clustering is an issue. In this paper, we propose a new linear and efficient initialization algorithm MaxMin Linear to deal with this problem. Then, we validate our theoretical results through extensive experiments on a variety of numerical real-world and artificial datasets. We also test several validity indices, including a new validity index that we propose, Transformed Standardized Fuzzy Difference (TSFD)

arXiv.org e-Print Archive

Crossref

HAL

Hal-Diderot

Contextual and Behavioral Customer Journey Discovery Using a Genetic Approach

Author: A Gabadinho
B Vázquez-Barreiros
G Bernard
İ Gürvardar
KN Lemon
S Peltola
T Caliński
VI Levenshtein
WMP Aalst van der
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

With the advent of new technologies and the increase in customers’ expectations, services are becoming more complex. This complexity calls for new methods to understand, analyze, and improve service delivery. Summarizing customers’ experience using representative journeys that are displayed on a Customer Journey Map (CJM) is one of these techniques. We propose a genetic algorithm that automatically builds a CJM from raw customer experience recorded in a database. Mining representative journeys can be seen a clustering task where both the sequence of activities and some contextual data (e.g., demographics) are considered when measuring the similarity between journeys. We show that our genetic approach outperforms traditional ways of handling this clustering task. Moreover, we apply our algorithm on a real dataset to highlight the benefit of using a genetic approach

Crossref

Serveur académique lausannois

CJM-ab: Abstracting Customer Journey Maps Using Process Mining

Author: A Gabadinho
G Schwarz
J Vanhatalo
JC Buijs
KN Lemon
SJJ Leemans
SJJ Leemans
SJJ Leemans
T Caliński
W Aalst van der
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Crossref

Serveur académique lausannois

A Method for Estimating the Efficiency of Commanding in the State Fire Service of Poland

Author: A Krasuski
Adam Krasuski
J Rahikainen
JP Royston
JP Royston
Karol Kreński
M Laan Van der
PJ Rousseeuw
S Deerwester
SCK Shiu
Stanisław Łazowy
T Caliński
TK Landauer
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

How (not) to measure bias in face recognition networks

Author: A Norval
B Yu
DF Smith
DJ Robertson
G Guo
JC Dunn
K Zhang
L Royakkers
M Alvi
M Amirian
M Hashemi
M Mann
M Merler
P Bernal
PJ Rousseeuw
R Bellman
R Rothe
S Li
T Caliński
T Stadelmann
Y Guo
Y Li
Publication venue: Springer
Publication date: 02/09/2020
Field of study

Within the last years Face Recognition (FR) systems have achieved human-like (or better) performance, leading to extensive deployment in large-scale practical settings. Yet, especially for sensible domains such as FR we expect algorithms to work equally well for everyone, regardless of somebody's age, gender, skin colour and/or origin. In this paper, we investigate a methodology to quantify the amount of bias in a trained Convolutional Neural Network (CNN) model for FR that is not only intuitively appealing, but also has already been used in the literature to argue for certain debiasing methods. It works by measuring the "blindness" of the model towards certain face characteristics in the embeddings of faces based on internal cluster validation measures. We conduct experiments on three openly available FR models to determine their bias regarding race, gender and age, and validate the computed scores by comparing their predictions against the actual drop in face recognition performance for minority cases. Interestingly, we could not link a crisp clustering in the embedding space to a strong bias in recognition rates|it is rather the opposite. We therefore offer arguments for the reasons behind this observation and argue for the need of a less naive clustering approach to develop a working measure for bias in FR models

Crossref

ZHAW digitalcollection

A cluster-based approach to selecting representative stimuli from the International Affective Picture System (IAPS) for emotion elicitation

Author: A Mehrabian
Adam Moore
AK Jain
Alexandra C. Constantinescu
B Deen
C Fraley
C Leys
C Lithari
C Xing
CB Do
CR Glenn
D Borcard
D Grühn
D Watson
E Anderson
E Bernat
E Dimitriadou
EY Mun
F Aguilar de Arcos
GH Ball
HW Koenigsberg
J Wu
JA Hartigan
JA Hartigan
JA Mikels
JA Mikels
JA Russell
JC Tomaszczyk
JE LeDoux
JF Stins
K Hornik
L Hubert
L Kaufman
M Eizenman
M Hallahan
M Kantardzic
M Meilă
Maria Wolters
MM Bradley
MM Bradley
PJ Lang
RD Lane
RL Perri
S Delplanque
S Dolnicar
Sarah E. MacPherson
SB Hamann
SB Most
T Caliński
TA Ito
X Zhang
X Zhao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Crossref

Springer - Publisher Connector

Edinburgh Research Explorer

Classification of frequency response areas in the inferior colliculus reveals continua not discrete classes

Author: Adrian Rees
Aitkin LM
Alan R. Palmer
Caliński T
Cant N
Christian J. Sumner
Davis KA
Kiang NYS
LeBeau FEN
LeBeau FEN
Malmierca MS
Oliver DL
Oliver Zobay
Palombi PS
Rhode WS
Rose J
Rose JE
Trevor M. Shackleton
Xu R
Yang LC
Publication venue: 'Wiley'
Publication date: 10/06/2013
Field of study

A differential response to sound frequency is a fundamental property of auditory neurons. Frequency analysis in the cochlea gives rise to V-shaped tuning functions in auditory nerve fibres, but by the level of the inferior colliculus (IC), the midbrain nucleus of the auditory pathway, neuronal receptive fields display diverse shapes that reflect the interplay of excitation and inhibition. The origin and nature of these frequency receptive field types is still open to question. One proposed hypothesis is that the frequency response class of any given neuron in the IC is predominantly inherited from one of three major afferent pathways projecting to the IC, giving rise to three distinct receptive field classes. Here, we applied subjective classification, principal component analysis, cluster analysis, and other objective statistical measures, to a large population (2826) of frequency response areas from single neurons recorded in the IC of the anaesthetised guinea pig. Subjectively, we recognised seven frequency response classes (V-shaped, non-monotonic Vs, narrow, closed, tilt down, tilt up and double-peaked), that were represented at all frequencies. We could identify similar classes using our objective classification tools. Importantly, however, many neurons exhibited properties intermediate between these classes, and none of the objective methods used here showed evidence of discrete response classes. Thus receptive field shapes in the IC form continua rather than discrete classes, a finding consistent with the integration of afferent inputs in the generation of frequency response areas. The frequency disposition of inhibition in the response areas of some neurons suggests that across-frequency inputs originating at or below the level of the IC are involved in their generation

Crossref

Nottingham Trent Institutional Repository (IRep)

PubMed Central

Clustering Algorithms: Their Application to Gene Expression Data

Author: Agrawal R.
Alizadeh A.A.
Bandyopadhyay S.
Bandyopadhyay S.
Bezdek J.C.
Bezdek J.C.
Bezdek† J.C.
Bhargavi M.S.
Blatt M.
Bochkov Y.A.
Brunet J.P.
Bryan K.
Buitinck L.
Bunnik E.M.
Caliński T.
Chandrasekhar T.
Cheng Y.
Costa I.G.
Cover T.M.
D'haeseleer P.
Dave R.N.
Davies D.L.
De Morsier F.
Dempster A.P.
Dharmarajan A.
Dhillon I.S.
Divina F.
Do C.B.
Domany E.
Du Z.
Dunn† J.C.
Edla D.R.
Eisen M.B.
Ferguson T.S.
Frey B.J.
Fu L.
Fukuyama Y.
Galluccio L.
Gath I.
Getz G.
Gordon G.J.
Gu J.
Guha S.
Handhayani T.
Handl J.
Hatamlou A.
Heard N.A.
Heyer L.J.
Hinneburg A.
Hinneburg A.
Hu X.
Hubert L.J.
Jain A.K.
Jiang D.
Jiang H.
Joopudi S.
Kao Y.T.
Karmilasari S.W.
Karypis G.
Kaufman L.
Kerr G.
Kluger Y.
Kohonen T.
Kohonen T.
Krzanowski W.J.
Leone M.
Lu Y.
Lu Y.
Ma'sum M.A.
MacQueen J.
Madeira S.C.
Mann A.K.
Masciari E.
Maulik U.
Milligan G.W.
Mitra S.
Moon T.K.
Moore W.C.
Müllner D.
Nagpal A.
Nasser S.
Neal R.M.
Ng R.T.
Pakhira M.K.
Pal N.R.
Pedregosa F.
Pirim H.
Pitman J.
Prelić A.
Qin Z.S.
Raman S.
Rasmussen C.E.
Rezaee B.
Rezaee M.R.
Ruspini E.H.
Saha S.
Saha S.
Saha S.
Sathishkumar K.
Sheikholeslami G.
Sheng Q.
Sirinukunwattana K.
Sokal R.R.
Sun J.
Talaat A.M.
Tamayo P.
Tanay A.
Tang C.
Thalamuthu A.
Tibshirani R.
Wan M.
Wang L.
Wang W.
Williams G.
Wu J.
Wu K.L.
Wu S.
Xie X.L.
Xu R.
Xu Y.
Yu H.
Zhang D.
Zhang T.
Zhang Y.
Zhang Z.Y.
Zhao L.
Zhong C.
Zitnik M.
Řehůřek R.
Publication venue: 'SAGE Publications'
Publication date: 01/01/2016
Field of study

Gene expression data hide vital information required to understand the biological process that takes place in a particular organism in relation to its environment. Deciphering the hidden patterns in gene expression data proffers a prodigious preference to strengthen the understanding of functional genomics. The complexity of biological networks and the volume of genes present increase the challenges of comprehending and interpretation of the resulting mass of data, which consists of millions of measurements; these data also inhibit vagueness, imprecision, and noise. Therefore, the use of clustering techniques is a first step toward addressing these challenges, which is essential in the data mining process to reveal natural structures and iden-tify interesting patterns in the underlying data. The clustering of gene expression data has been proven to be useful in making known the natural structure inherent in gene expression data, understanding gene functions, cellular processes, and subtypes of cells, mining useful information from noisy data, and understanding gene regulation. The other benefit of clustering gene expression data is the identification of homology, which is very important in vaccine design. This review examines the various clustering algorithms applicable to the gene expression data in order to discover and provide useful knowledge of the appropriate clustering technique that will guarantee stability and high degree of accuracy in its analysis procedure

Covenant University Repository

Crossref

Directory of Open Access Journals

PubMed Central